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CONFLICT  AMONG  TESTING  PROCEDURES? 


1.  Introduction 

^Savin  [1976]  and  Bemdt  and  Savin  [1977],  among  others,  have 
pointed  out  that  an  inequality  relation  exists  between  the  Lagrange 
Multiplier  Test  (LM) ,  the  Wald  Test  (W) ,  and  the  Likelihood  Ratio 
Test  (LR) .  However,  since  all  tests  converge  to  the  same  limiting 
Chi-square  distribution  they  are  usually  compared  against  the  same 
critical  value.  This  raises  the  possibility  of  conflicting  conclu¬ 
sions  from  the  three  tests. 

Kohler  [1979]  and  Vandaele  [1981]  have  shown  that  the  three  tests 
are  monotonic  functions  of  each  other.  This  implies  that  they  have 
identical  power  characteristics.  In  particular,  if  the  probabilities 
of  Type  I  errors  are  equal  among  the  three  tests,  they  have  to  have 
the  same  probability  of  Type  II  errors  as  well.  In  essence  we  are 
dealing  with  one  and  the  same  test. 

In  this  paper  we  review  briefly  how  the  tests  are  related  and 
why  the  Inequality  relation  exists.  We  then  derive  criteria  which 
allow  us  to  determine  which  test  is  more  appropriate  in  a  given 
situation.  This  should  resolve  possible  conflicts  for  at  least  some 


2.  The  Three  Tests  and  the  Inequality  Among  Them 


Consider  the  model: 


(1) 

Y  =  XB  +  e 

(2) 

e  -  N(0,a2I)  . 

->  -2 

Let  B  and  a  be  the  maximum- likelihood  estimators  obtained  by 
unconstrained  maximization  of  the  likelihood  function,  and  B  and  o' 
the  corresponding  estimators  obtained  by  maximizing  the  likelihood 
function  subject  to  the  constraint  R8  =  r.  Furthermore,  we  shall 
need  an  estimator  for  the  Lagrange  multiplier  (y)  and  the  ratio  of 
the  constrained  to  unconstrained  maxima  of  the  likelihood  function 


(X). 

The  three  test 

statistics  can  be  written  as: 

(3) 

LR  - 

-2  log  X 

(A) 

W  « 

( (r  -  R8),(R(X,X)~1R')"1(r  -  RB)J/S2 

(5) 

LM  * 

li(R(X,X)"1R,)y]/52  . 

To  simplify  the  notation  let  A  «  RC~  X'X)  *R' .  From  the  first- 
order  conditions  for  maximizing  the  likelihood  function  subject  to 
the  constraint  we  can  obtain  an  expression  for  y: 


(6) 


U  •  (R(X'X)*1R')"1(r  -  R6)  . 


-3- 


We  can  now  rewrite  Eqs.  (4)  and  (5)  as 

(7)  W  =  n[(r  -  R§)'A_1(r  -  RB)]/o2 

(8)  LM  =  n[(r  -  RB),A_1(r  -  RB)]/o2  . 

Recall  that  a2  =  a2  +  (r  -  RB)'A_1(r  -  R§)  and  X  =  ( a2/a2)~n/ 2. 

-2  *2 

Adopting  the  shorthand  notation  0  for  a  /a  we  can  rewrite  the  three 
tests  as : 

(9)  LR  *  n  •  log  (0) 

(10)  W  =  n  •  (0-1) 

(11)  LM  =  n  •  (1  -  -|)  . 

Given  that  0  i  1  with  probability  1  this  establishes  the  inequality 

(12)  LM  <  LR  <  W  . 


-4- 


I.  Which  Tests  Should  Be  Used? 


Equations  (9)  through  (11)  allow  us  to  express  any  one  of  the 
three  tests  as  a  function  of  any  other.  Furthermore,  we  can  trans¬ 
form  the  W  statistic  into  an  F  statistic  by  a  standard  degrees  of 
freedom  adjustment.  Thus  for  small  samples  we  can  express  any  one 
of  the  three  tests  as  a  transformation  of  an  F  test  for  which  exact 
critical  values  can  be  calculated. 

However,  few  researchers  go  through  all  that  trouble.  Most 
comnonly,  the  value  of  the  test  statistic  is  simply  compared  to  the 
critical  value  c  obtained  from  a  standard  Chi-square  table.  This 
is  the  source  of  possible  conflicts  since  the  three  tests  differ 
nunk’-ically  but  are  compared  against  the  same  critical  value. 

Let  c^  be  the  a  percent  critical  value  of  the  Chi-square 

y 

distribution,  i.e.,  Pr{\'  >  c^Ja.  We  can  now  calcuj.  te  the  probability 
of  a  Type  I  error  for  any  one  of  the  tests.  In  particular  we  have: 

(13)  P_(W)  =  Pr{W  >  c  } 

I  a 

(14)  Pj(LM)  =  Pr{LM  >  ca>  . 

By  expressing  LM  as  a  function  of  W  we  get 

?,«*)  -  Pr{W 

*  Pr{W(l  -  c  /n)  >  c  } 
a  a 

■  Pr{U  >  'a  '  '  r^TTn 1 }  <  V“>  • 


(15) 
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We  can  also  establish  that  Pj(W)  >  a: 


Pj(W) 


Pr{W  >  c  } 
a 


■ Pr(F  •  "'rrr *  <=„> 


-  Pr{F  >  -2  (i  -  k/„))  >  a 

q 


Since  c  / q  is  less  than  or  equal  to  the  critical  value  of  an  F 
distribution  for  the  same  level  of  significance,  and  (1  -  k/n)  <  1, 
Pj(W)  must  be  larger  than  a.  By  combining  Eqs.  (15)  and  (16)  we  can 
compare  P^(LM)  to  a. 


Pt(LM) 


Pr{W  >  c  [- - 

a  1  -  c  /  n 


•  P"F  • 

a 


=  Pr{F  >  —  []  ~  k/n.  ]} 
q  1  -  c  /n 
a 


If  the  bracketed  expression  in  Eq.  (17)  is  less  than  or  equal  to  one, 

PT(LM)  is  larger  than  a.  In  other  words,  if  c  i  k,  we  have  the 
i  a 

relationship 


PI(W)  >  PT(LM)  >  a 


c  -  k 
a 


The  interesting  part  about  this  relationship  is  that  it  depends  not 
only  on  q  and  k,  but  also  on  the  level  of  significance  chosen.  If 
we  are  estimating  five  parameters  with  one  constraint,  we  get  the 
critical  values  c  *  3.841  for  a,  *  .05,  and  c  *  6.635  for  a„  m  .01 

otj  1  ’  a2  2 

From  Eq.  (18)  we  can  determine  that  at  significance  level  a^,  the  LM 
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test  is  certainly  the  more  accurate  test*  i.e.,  Pj(LM)  is  closer  to 
the  postulated  value  of  a ^  than  P^(W).  For  significance  level 
the  question  is  open,  and  depends  on  the  difference  between  c^/q 
and  the  corresponding  critical  value  for  the  F  distribution  which 
in  turn  depends  on  the  sample  size. 


4.  Conclusions 


We  have  shown  that  the  well  known  inequality  relation  between 
the  LM,  W  and  LR  tests  does  not  need  to  lead  to  conflicting  results. 
Since  the  tests  are  monotonic  functions  of  each  other,  as  well  as  of 
the  familiar  F  test,  we  can  calculate  the  precise  critical  values 
that  will  equate  the  probability  of  a  Type  I  error  for  all  three 
tests.  Under  these  circumstances,  the  three  tests  will  have  exactly 
the  same  probability  of  rejection  and  identical  power.  Conflicting 
results  are  impossible. 

If  we  use  the  same  critical  value  for  the  three  tests,  i.e.,  c^ 
obtained  from  the  Chi-square  distribution  towards  which  all  three 
tests  converge,  conflicts  are  possible.  However,  under  certain 
circumstances  we  can  show  that  P^CLM)  is  closer  to  a  than  Pj.(W), 
and  by  inference  Pj(LR)  which  is  situated  between  P^.(W)  and  Pj(LM) , 
which  should  lead  us  to  prefer  the  LM  test  since  it  is  more 
accurate,  i.e..  Its  probability  of  rejection  is  closer  to  the 
postulated  value  a. 
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